AMENDMENT TO THE CLAIMS 



1-6. (Cancelled) 



7. (Currently Amended) A method of learning with an automatic speech recognition system, 
the method comprising: 

detecting a change to dictated text; 

inferring whether the change is a correction or editing; 

wherein inferring whether the change is a correction or editing includes comparing a 
speech recognition engine score of the dictated text and of the changed text; 
if the change is inferred to be a correction, selectively learning from the nature of the 

correction without additional user interaction; 
wherein selectively learning from the nature of the correction includes: 

determining if a user's pronunciation deviated from existing pronunciations 
known by the system by doing a forced alignment of a wave based on at 
least one context word, wherein determining if the user's pronunciation 
deviated from existing pronunciations also includes identifying in the 
wave the pronunciation of the corrected word; and 
determining if the corrected word exists in the user's lexicon, and if the corrected 
word does exist in the user lexicon, selectively changing m-HMM-a 
parameter associated with the pronunciation; 
building a lattice based upon possible pronunciations of the corrected word and the 
recognition result; 

generating a confidence score based at least in part upon the distance of the 
pronunciation with the possible pronunciations; and 

wherein the confidence score is calculated using the function l/[d/f/log(lenl+len2)], 

where d is the distance between the recognized pronunciation and a best match in 
a lexicon, f is a frequency that the same pronunciation is pronounced, and lenl 



-3- 



and len2 are the lengths of phonemes in a new pronunciation and the closest 
pronunciation, respectively. 

8-17. (Cancelled) 

18. (Previously Presented) The method of claim 7, and further comprising generating a 
confidence score based at least in part upon an Acoustic Model score of the pronunciation with 
the possible pronunciations. 

19, (Previously Presented) The method of claim 7, wherein selectively learning the 
pronunciation includes comparing the confidence score to a threshold. 

20-31. (Cancelled) 

32. (Previously Presented) The method of claim 7, wherein the distance is calculated using a 
phone confusion matrix and Dynamic Time Warping. 

33-36. (Cancelled) 

37. (Currently Amended) A method of learning with an automatic speech recognition system, the 
method comprising: 

detecting a change to dictated text; 

inferring whether the change is a correction or editing; 

wherein inferring whether the change is a correction or editing includes comparing a 
speech recognition engine score of the dictated text and of the changed text; 

if the change is inferred to be a correction, selectively learning from the nature of the 
correction without additional user interaction; 

wherein selectively learning from the nature of the correction includes: 



determining if a user's pronunciation deviated from existing pronunciations 

known by the system by doing a forced alignment of a wave based on at 
least one context word, wherein determining if the user's pronunciation 
deviated from existing pronunciations also includes identifying in the 
wave the pronunciation of the corrected word; and 

determining if the corrected word exists in the user's lexicon, and if the corrected 
word does exist in the user lexicon, selectively changing an HMM a 
parameter associated with the pronunciation; 
generating a confidence score based at least in part upon the distance of the 

pronunciation with the possible pronunciations; and 
wherein the confidence score is calculated using the function l/[d/f/log(lenl+len2)], 

where d is the distance between the recognized pronunciation and a best match in 

a lexicon, f is a frequency that the same pronunciation is pronounced, and lenl 

and len2 are the lengths of phonemes in a new pronunciation and the closest 

pronunciation, respectively. 

38. (Previously Presented) The method of claim 37, wherein selectively learning the 
pronunciation includes comparing the confidence score to a threshold. 



